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Abstract 

We propose a novel trust metric for social networks which is suitable for application to 
recommender systems. It is personalised and dynamic, and allows to compute the indirect 
trust between two agents which are not neighbours based on the direct trust between agents 
that are neighbours. In analogy to some personahsed versions of PageRank, this metric makes 
use of the concept of feedback centrality and overcomes some of the limitations of other trust 
metrics. In particular, it does not neglect cycles and other patterns characterising social net- 
works, as some other algorithms do. In order to apply the metric to recommender systems, 
we propose a way to make trust dynamic over time. We show by means of analytical approx- 
imations and computer simulations that the metric has the desired properties. Finally, we 
carry out an empirical validation on a dataset crawled from an Internet community and com- 
pare the performance of a recommender system using our metric to one using collaborative 
filtering. 

Keywords: Trust, Social Networks, Recommender Systems, 
Personalisation, Information Overload 

1 Introduction 

An increasing number of information technologies focuses on how web users can effectively share 
opinions about various types of products, services or even other users. These technologies are the 
basis of several types of Web 2.0 applications such as collaborative tagging, social bookmarking 
[Ull] and, in particular, also recommender systems. Given the heterogeneity of web users, a major 
issue is how to appropriately aggregate opinions in order to provide judgements that are useful 
for each individual user. 

Most of these applications use collaborative filtering algorithms which compute an index of 
similarity between users or between items, based on the ratings that users have provided on 
these items [8l[l2l[19]. When a user belongs to a community with common, shared tastes, these 
algorithms work well in suggesting new items similar to the ones the users have already rated. 
There are several other benefits: except providing enough ratings, no further action is required 
of users; algorithms for collaborative filtering are scalable (when similarities are computed across 
items [22]); and, finally, they provide some level of personalisation. A shortcoming is that if users 
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are looking for items which are seldomly rated by their community, the predictions are poor - 
e.g. people who have rated only travel books may not receive very good recommendations on 
tools for gardening. 

To cope with this, a line of research has focused on basing recommendations for users not on their 
similarity, but on their trust relations to other users. In this context, trust is meant to be the 
"expectancy of an agent to be able to rely on some other agent's recommendations" [151 [25]. There 
has been a body of work on "trust webs" [U [TOl [III [21] and on their application to recommender 
systems [7l[l6l[18]. The small- world property of social networks [20] allows to potentially reach 
a lot of information, while the trust allows to filter out the relevant pieces [25]. The benefits 
of these trust-based algorithms include strong personalisation, no need to have a long rating 
history in the system because recommendations are not based on similarity, and the ability to 
receive recommendations on items different from the ones already rated. Some limitations of the 
trust-based approach concern the scalability and the fact that, in addition to their ratings of 
items, users have to provide information about their level of trust to some other users. 

In this paper, we introduce a novel metric for trust in social networks. A trust metric allows to 
compute the indirect trust between two agents in a social network which are not neighbours, 
based on the direct trust between agents that are neighbours. While it is intuitive to do this 
on a chain, e.g. from user A via user B to user C, for instance by multiplying the values of 
trust along the chain, it is not a priori trivial how to proceed when a graph contains multiple, 
redundant paths, cycles, or triangles (because of mathematical issues related to uniqueness and 
consistency). This is a crucial issue because these patterns all play an important role in social 
networks, in particular for the diffusion of information and the build-up of social capital [271124] . 
Some trust metrics address these issues by reducing the direct trust graph to an acyclic graph 
before applying their computation of indirect trust [7], [IB]. Other metrics use only the path of the 
shortest distance or of the highest trust [22]. Our trust metric takes all the paths in the graph 
into account and it is well-defined on any given graph. It provides each user with personalised 
trust ratings about other users in the network. Our metric also is dynamic, i.e. it evolves in time 
depending on how useful the information received by users is to them. This makes the metric 
suitable for application in recommender systems, as we will illustrate in the remainder of the 
paper. 

2 Background and Motivation 

Consider a scenario in which there is a social network of agents which have trust relationships 
among each other. This can be described by a graph in which the nodes represent the agents and 
the links represent the trust relationships. There also is a set of objects which can be rated by 
agents. Since each agent only knows a few objects, it may want to know other agent's opinions 
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on unknown objects. However, since there are potentially many opinions of other agents, it needs 
to be able to determine which of these are trustworthy. This implies that an agent needs to 
reason about the trustworthiness of other agents [25]. However, since its time and resources are 
constrained, an agent can only build and maintain trust relationships with a limited number of 
other agents. 

Thus, if Tij G [0, 1] represents the level of direct trust of agent i towards j, how do we compute 
the indirect trust fi^i between two agents k and I that are not neighbour^? 

In the following, we will describe the TrustWebRank metric for computing indirect trust in 
a network with direct trust. This metric builds on the concept of feedback centrality which 
assigns a centrality score to the nodes of a network based on the centrality scores of the node's 
neighbours. In other words, in feedback centrality, the higher (or lower) the centrality score of 
a node's neighbours, the higher (or lower) this node's own centrality is. These principles can 
be adapted to define a metric for the trustworthiness of agents in a social network with trust 
relationships. 

We briefly review PageRank, one of the most widely known and studied feedback centrality 
algorithms [11[3]. In our scenario this would mean to assign a trustworthiness score q to an agent 
i that depends on the trustworthiness of its neighbours j (adapted from [3]): 

^^ = /5 E ^ + (1-^) ^^' (1) 

where Ni is the set of neighbours of i, and /3 is a damping factor which is chosen around 0.8 [1]. 
In vector notation: 

c = pPc+il- (2) 

where P is a stochasticEl transition matrix defined as 

if there exists a link from j to i 

I AT, I 

otherwise. 

Eqs. ([I]) and ([3]) can easily be extended to weighted graphs [3]. Solving Eq. ([2|) for c we obtain: 

c = iI-(3PrHl-(3)l, (4) 

where / is the identity matrix and 1 is the vector consisting of ones. Since P is, by construction, 
stochastic and thus, by the Perron- Frobenius theorem [23], the largest eigenvalue is Apf(-P) = 1, 



^Variables expressing indirect trust are as the corresponding ones expressing direct trust, but with a tilde 
symbol: e.g. T and T. 

^We will always assume row-stochastic when we state "stochastic"; this does not imply that the matrix need 
(or not) to be column-stochastic. 
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it follows that Xpf{PP) =/?<!. This ensures the existence of a unique solution of c. Usually, 
one uses Jacobi iteration to compute such a solution. 

The result of applying this algorithm to a graph is a vector which gives a score of the trust- 
worthiness Cj for each node i in the graph. Note that this is a global metric, i.e. there is one 
score for each agent. It has been observed in the literature that, for recommender systems, such 
metrics are often not appropriate and that local metrics, which are personalised for each agent 
("how trustworthy is agent i from the perspective of agent j"), are required [E]. EigenTrust, for 
example, is a PageRank-inspired, global trust metric [H]. 



3 A Novel Trust Metric 
3.1 From Centrality to Trust 

Proceeding in analogy to PageRank and using the principles of feedback centrality to construct 
a personalised metric for trust, one could define the indirect trust of agent i to j as the indirect 
trust of the neighbour agents k of agent i to agent j, weighted by the trust of agent i towards 
these neighbour agents k. Let T be the trust matrix, where Tij £ [0, 1] reflects the direct trust 
from agent i to agent j [Tij = if there is no link between agent i and agent j). S is the stochastic 
matrix 

Tij 

where Ni is the set of neighbours of agent i. 5 is a normalisation of T. We define Tij to be the 
indirect trustworthiness score from i to j: 

Tij = ^ Sikfkj \/i,j (6) 

This allows us to estimate the trust between any two agents i and j: if there is a link between 
i and j, Tij reflects the trust between them; if there is no link between i and j, Tij reflects the 
trust between them. Notice that this definition is similar to to the approaches used in [Illll]. In 
matrix notation, this is the recursive definition 

f = Sf (7) 
Notice that this approach has several limitations: 

1) Uniqueness of the solution: Let u^j be one column of T, i.e. the vector that expresses how 
much agent j is trusted by other agents. Then, Eq. ([7]) gives 

v^^j = Sv^j Vj. (8) 



4/[l9] 
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If S is acyclic [23] (i.e. the underlying graph is so), then there is a unique solution of Eq. ([8]). If 5" 
is not acyclic, it can be either primitive or non-primitive [13]. If S is primitive (and stochastic), 
there is a unique solution of Eq. ([8]), a vector with all components being identical [23]. This 
would imply that all agents i would trust agent j equally, which is obviously not desirable. If S 
is not primitive, there are multiple solutions for Eq. ([8]), which also is not desirable. 

One way of dealing with this could be to make S acyclic, for example by constructing a tree with 
a breadth-first search (BFS) from a chosen node, as for example [TJ [16] do. The BFS selects one 
node as a root, and from there on, explores the neighbours of the nodes, proceeding in levels 
1, 2, 3, . . . and removing links within a level and links from level k to level I where I < k at each 
step. However, this entails further limitations: 

Social networks are characterised by a high clustering coefficient [271 EHl [21]. By making the 
underlying graph of a social network acyclic, one removes the links within each level and the 
links from levels k to / where I < k, thus making the clustering coefficient 0. This implies that, 
subsequent to this procedure, the trust metric will not be able to differentiate well between 
regions of high clustering (thus, possibly high trust) and regions with lower clustering (thus, 
possibly lower trust) as on the original graph. 

Further, depending on which node is chosen as the root of the BFS, the acyclic graph will be 
different. This is not a problem in a decentralised scenario, when the computation is spread 
over many nodes. In this case, each node computes its own set of v^j by being root of its own 
breadth-first exploration. However, this is a problem in a centralised scenario, where such an 
approach is not scalable and also not mathematically tractable: as a result of a BFS rooting at 
each i, the computation uses a different matrix T for each node. 

2) Combination of direct and indirect trust: The metric defined in Eq. ([6]) is not able to account 
properly for the following situation: consider an agent i that trusts a neighbour agent j with 
intermediate level of trust, e.g. Tij « 0.5, because it does not yet know this agent well. If many 
of the other neighbours of agent i trust agent j, this should increase the trust between agent i 
and j. This does not happen with the current definition of trust. 

3) Normalisation of trust: another property, resulting from Eq. ([5|), is that the normalisation 
removes knowledge from the system. If an agent i trusts n neighbours equally, it does not matter 
whether it trusts them a lot or a little in [0, 1] - the normalisation would assign the same value of 
trust of ^ to each of the neighbours. Then, during propagation, only the relative trust compared 
to other neighbours is considered. Equally, suppose that an agent i has just one neighbour agent 
j - no matter whether i trusts j highly or lowly, in each case the normalisation would cause the 
trust from i to j to be 1. The normalisation is necessary, however, to have values of direct and 
indirect trust which are in the same range. 
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3.2 The TrustWebRank Metric 

Thus, given these hmitations, can we modify Eq. ([6]) in such a way that the following requirements 
are met? 

Requirement 1: The solution of the equation over graphs with cycles is unique, but not trivial. 

Requirement 2: The range of indirect trust is the same as for direct trust, i.e. [0, 1], so that 
direct and indirect trust can be compared. 

Requirement 3: In the metric, direct trust "adds on" to indirect trust (capturing the fact that 
it complements it). 

One possibility to address these issues is the following: we compute the indirect value of trust 
between two agents i and j based on the direct trust between them, if there is any, but also 
based on the trust that the neighbours of i have in j: 

fij = Sij + /? ^ Sikfkj (9) 

where f3 G [0, 1). Now, in matrix form Eq. ((H) is 

f = S + pSf, (10) 

and, using elementary algebra, we can derive 

f = {I-pSy^S. (11) 

There exists a unique, non-trivial solution to Eq. (fTTl) if Apf(/35') < 1, [13]. Since S is stochastic, 
i.e. Apf(5') = 1, and (3 G [0, 1), it follows that Xpf{(3S) < 1 (Requirement 1). 

The parameter (3 has a similar role as the damping factor in PageRank in Eq. ([T]): given (3 G [0, 1), 
the impact of agents far away in the social network is discounted. This can be seen more clearly 
when expressing (1 — I3S)~^ as a geometric sum in Eq. ([TT]) [T3] : 



oo 

f = (1 - pS)-^S = {PS)''S = S + PS^ + P^S^ + ... (12) 

k=0 



The kth power of the adjacency matrix of a graph gives the number of walks of length k between 
any two nodes in the graph. Similarly, the fcth power of the matrix S gives the sum of the 
products of the weights along all walks of length k in the underlying graph of S. In Eq. ([T2]l . 
the higher the length of the walks, the stronger the discount (since /? < 1). As in PageRank, a 
reasonable value of f3 turns out to be around 0.75 to 0.85 (see Section [13]) . Note that Tjj ^ [0, 1]. 
We can normalise it to 

= (13) 



6/119] 
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to ensure the comparability of values of direct and indirect trust (Requirement 2). 

Furthermore, if agents i and j are not neighbours, the indirect trust of i to j is entirely based on 
how much the neighbours of i trust j. However, if agent i has a neighbour j, the indirect trust of 
i to j will also incorporate how much the other neighbours of agent i trust or do not trust agent 
j (Requirement 3). 

The definition of Eqs. ^ and (fTOl) naturally takes the real structure of a social network into 
account without needing to prune any link. Unlike to what would happen during the conversion 
of the underlying graph to a tree using a BFS, the algorithm preserves the links which, in a social 
network, lead to a high clustering coefficient, and are not negligible when reasoning about the 
social network itself [271[20l[2i] . 

When dealing with huge graphs, however, inverting a matrix as required by Eq. (fTTI) poses an issue 
of computation time and memory. Yet, instead of inverting a matrix or computing eigenvectors, 
it is possible to use an iterative method [3] as follows: 

At each step k, one only needs the neighbourhood Ni of a given agent i, as well as access to 
the matrix of T^^"^) computed at the previous step k — 1. Notice that now we are computing a 
matrix while, with the centrality, e.g. in PageRank, we were computing a vector. This is natural 
since the centrality is one value per agent (it is a global notion), while trust is a value per pair 
of agents (it is a local, personalised notion). Therefore computing trust {'-^ 0{N'^)) is inherently 
more expensive than computing centrality (~ 0{N)). However, do we really need to compute 
indirect trust among all agents? In fact, for a given agent i, computing the trust to a selected 
amount of other agents j, if well chosen, will be sufficient, as the trust to agents far away in 
the network will be damped out anyway. So, the scalability of the trust computation rather is 
(~ 0{mN)), where m is the number of other agents j to consider for each agent i. 

4 An Application of the Metric 

So far, we have described a trust metric which allows to compute a measure of trust between two 
agents which are not necessarily neighbours in a social network. We will now construct a simple 
model which applies this metric in the context of a recommender system. The purpose is to show 
how it is possible to compute predictions of how an agent i likes a particular object o (suppose 
a book, CD, or movie) based on how other agents j liked that item combined with how much i 
trusts j. 



7m 
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4.1 A Simple Model 

Suppose we have a system of agents embedded in a social network, defined by a graph and 
associated to an adjacency matrix A. Each agent i keeps track of its trust relationships to 
neighbours j. These are reflected in the matrix of direct trust T. Obviously, Tij > only if 
Aij = 1. For the moment, we take the network to be described by a random graph [6l[2] in which 
each agent roughly has the degree d. 

Let each agent i be characterised by a profile tTj. The profile expresses which ratings an agent 
would give to all possible objects; however, agents only know a subset of their ratings on objects. 
Given an object o, r° G { — 1,1} is the rating of agent i on object o. If an agent is willing to 
share all its opinions with other agents, then the set of all of its ratings corresponds to its profile; 
however, there may be agents which are not willing (because they want to keep their secrets) or 
able (because they simply do not know particular objects) to share ratings. This can be captured 
by a parameter rj which reflects the probability of an agent to share - i.e. signal - its rating with 
other agents. E.g., a value of r] = 0.1 would imply that, on average, at each time step 10 agents. 
At the moment, t] is the same value for all agents, but it could also be set differently for each 
agent i or even for each pair of agents i and j. 

If an agent i is not willing or able to share its rating for an object o, the system computes a 
prediction p° as follows: 



i.e. the prediction for an agent i is the sum of the ratings of all neighbours j weighted by the 
indirect, normalised trust that agent i has in these neighbours j. 

Note that this bears resemblance to Collaborative Filtering (CF) [HI [E] in which the prediction 
for an agent i is also computed as a weighted sum of the ratings of all neighbours j (not neighbours 
in a graph-theoretic sense, but neighbours in terms of similarity of ratings). The more similar 
a neighbour, the more influential its rating will be for the prediction. In our case, making a 
prediction based on the ratings of the trusted neighbours implies that we make the assumption 
that agents who are connected by trust have similar mind-sets. Notice that this does not imply 
that they have rated the same items - for example, one user could appreciate the knowledge 
of another user in gardening, even though his own domain are travel books. Thus, unlike the 
similarity that could be computed e.g. by Pearson correlation, this notion of similarity extends 
not just across rated items, but rather is an "expected" similarity reflecting a similar mind-set of 
two agents. 




(15) 



SO p° G [—1, 1], since X^jeA^i ~ ^ ^^'^ ^ {~^^ vector notation, 

p = Sr, 



(16) 
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4.2 Trust Dynamics 

So far, we have a static model which, based on the trust web of a particular agent i and the 
ratings of its neighbours j, is able to compute predictions p° for that agent. We now would 
like to model the evolution of the trust network over time in the sense that, based on the quality 
of a particular recommendation, agent i can update its trust to its neighbours j. This adds a 
time dimension to the model and requires a mechanism to update the trust between neighbours. 
This can be done by adding a utility function: agents experience a utility by using the ratings 
or predictions of neighbours and then the trust update is coupled with the utility experienced. 
We define each agent i to experience a utility Uij{t) by following the recommendation from each 
neighbour j at time t as follows: 

J 1 - \r°{t) - r°{t)\ if j signals to i 
' \ l-|rf(t)-p°(t)| otherwise. ^ ^ 

Note that Uij{t) G [—1,1]. If the neighbour j signals to agent i, it knows the rating r°j{t); 
otherwise, it only knows a prediction p°j{t). The closer the recommendation of agent j for agent 
i to the rating of agent i is, the greater the agents' similarity is and thus the higher the utility 
Uij{t) that agent i experiences from the recommendation of agent j at step t is. Note that because 
of the level of cooperation rj - which affects whether agent j signals to i - the utility takes into 
account not only similarity [28], but also cooperation between agents. Based on the utility, agent 
i can update the trust towards its neighbour agents j. We distinguish four cases, based on the 
sign and the magnitude of the utility: 

• If the sign is positive, this means that the rating or prediction of a neighbour was good; if 
it is negative, it means that the rating or prediction was bad. 

• If the magnitude is large, the neighbour had a lot of trust in the rating/prediction of its 
own neighbours; if it is small, the neighbour had little trust in the rating/prediction of its 
own neighbours. 

This leads us to the following definition of how an agent i updates its trust to agent j from time 
t to t + 1: 



7r,,-(t) + (i-7)K-(t)| 

(18) 



if Uij{t) > Uthr or - Uthr < Uij{t) < 



^T,j{t)-{l-^)\ui,{t)\ 

if Uij{t) < -Uthr or < Uij{t) < Uthr 

where we take uthr = 0.5 and 7 € [0, 1] is a parameter that controls the relative weights of the 
current history of trust between two agents, and of the current utility, Uij{t). For 7 > 0.5, 
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Figure 1: Social network of agents and trust build-up over time in case of a fraction of agents 
not signalling as well as cycles in the underlying network: there are two profiles, red and blue, 
indicated by the cores of the node. Only square nodes are signalling; e.g., nodes 2, 3, and 4 are 
not signalling. There are two cycles from 4 to 7 and 8, respectively, to 9 and then to 4. After a 
few steps, the nodes learn which other nodes to trust. 



this gives the history of trust more weight than the current utility. In the analysis and simulations 
(next section), we found that 7 = 0.75 is a reasonable value. Since Tij € [—1,1], but we want 
Tij G [0, 1], we cap it to [0, 1]: 

Tij{t + 1) = max(0,min(l,fij)). (19) 

As an example, the effects of these dynamics are illustrated in Figured] this is an example of a 
network of agents having two profiles (red and blue). Some nodes are signalling (squares), others 
are not (circles). The network contains cycles. At t = 1, the agents are just connected, the trust 
between all agents is equal to zero. At t = 2, agent 3 and agent 4 have received recommendations 
from agents 5 and 6, and from agents 7 and 8, respectively. Since agent 3 (4) has the same profile 
as agents 5 and 6 (7 and 8), namely red (blue), it perceives a high positive utility from the 
recommendation and thus increases its trust to the recommending agents. At t = 3, the system 
can now provide a recommendation to agent 2, even though agents 3 and 4 are not signalling 
their own rating. Since agent 2 has the same profile as agent 3, trust between these two agents 
increases. Agent 2 perceives a high negative utility from the recommendation of agent 4, thus its 
trust remains zero. At the same time, the links from 3 to 5 and 6 reinforce. The same happens in 
the cycles. These mechanisms continue and we see that at t = 5, paths of trust have developed 
between agents of the same profile. Although agent 1 has no agents of its profile that are signalling 
in one or two levels of distance, it is still able to discover a path to two agents of its profile that 
are signaling and further away in the network. 



4.3 Analysis of the Model 

In this section we derive a self-consistent equation for the matrix of trust which allows to investi- 
gate the dynamics of trust. We analyse the case of a population of agents with only two opposite 
profiles (see Section H?T]) which provide ratings on objects as +1 or —1, respectively. 



lO/dl] 
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We want to compute the expected value of trust at the equihbrium of the dynamics defined in 
Eqs. (fT8]l and (fTOl) . We do so by a mean- field approximation in which we replace the utility Uij{t) 
in Eq. ([TS]) with the expected utility over time, denoted by Uij := E{uij(t)) (without the time 
dependency). We impose Tij{t) = Tij(t + 1) at the equilibrium, obtaining 

Tij = max(min(njj, 1),0), (20) 

which requires us to estimate Uij. Given the definition of Uij{t) in Eq. (flTl) and the fact that 
agents signal a rating with probability 7] and they do not with probability 1 — 77, it follows that 
the expected utility Uij is 

Uij = 77(1 - IvTi - TTj\) + (1 - 7]){l - \-Ki - ^Sjk-Kk\)- (21) 

k 

Since we are considering the simple case in which agents signal faithfully, the expected rating 
provided by an agent j coincides with its profile: E{r°) = vTj. We can thus express the ex- 
pected prediction for agent j as E{p^) = Ylk^jk'^k- In future work, we will also consider more 
complicated cases, e.g. including non-faithful (selfish or malicious) behaviour. Substituting into 
Eq. ([20]), we get: 

Tij = max(0, min(l, r/(l — |7rj — tTjI) 

+ (1 -??)(!- |7r,-^5jfe7rfc|))). (22) 

k 

Since the profiles vr are given, T is a function of S. Notice that by combining Eqs. p2m3l l5]l. 
we can express Sjk in terms of the components Tj^, {T'^)jk, {T^)jk,--- as well as Tji, {T'^)ji, 
{T''^)ji,... where I are the other neighbours of 7: 

It follows that we can express the value of trust Tij between any pair of agents in terms of the 
value of trust among the other pairs. This leads to a self-consistent equation for T, where the 
only parameters are the initial values of trust T(0), the probability to signal, 77, the discount 
factor along the walks of the graphs, /?, and the profiles of the agents, n: 

Tij=f{T,T{0),r,,P,7r)yi,j. (24) 

Notice that Eq. ([241) is obtained without any assumption on the structure of the network that is 
reflected in T. 

One is, of course, interested in the fixed points of Eq. ([2^ . their stability and whether they 
are attained by the dynamics. On the one hand, it is trivial to check that the matrix T with 
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Tij = 1 among agents with the same profile and Tij = among agents with opposite profile is a 
fixed point of Eq. ([2l|) . Denote this configuration as {T"*" = 1,T~ = 0}. On the other hand, the 
configuration with trust equal zero among all pairs {r+'~ = 0} is not a fixed point. 

In the next section, we find, by means of computer simulations, that the system, starting from a 
configuration with no trust among the agents, {T'^'^ = 0}, always evolves to a configuration in 
which agents with similar profile trust each other {r+ = 1,T~ = 0}. This is true even if agents 
do not signal all the time (i.e. rj < 1). A formal investigation of the stability of all the fixed 
points of Eq. (f2^ will be performed in future work. 

4.4 Simulations 

The simulations that we carried out were done on an agent population of 500 agents. We con- 
sidered two opposite profiles with ratings on objects as +1 or —1. The agents are connected in 
a random graph [6j[2]. Initially, Tij = Vz,j, i.e. the agents have to learn who to trust. We 
varied the average degree d of each agent, as well as the level of cooperation r] in the system. The 
following figures illustrate the system behaviour over 50 steps; all results were averaged over 100 
runs. 



time t 

Figure 2: Trust between agents of the same profile over time, for a fixed average degree of 
agents but variable level of cooperation. 

Figure [2] illustrates the average trust between agents of the same profile over time: the average 
degree of agents is fixed, d = 7, and the level of cooperation t] is variable, ranging from 0.01 
to 0.25 in steps of 0.01. The average trust between agents of the same profile converges to 1 
for almost all rj. For larger t], this process takes place much faster than for smaller rj. Given a 
sufficient level of cooperation in the system, the agents develop trust to the agents that have the 
same profile. Furthermore (this is not shown in the figure), agents of opposite profiles do not 
develop trust between each other. 



d=7 




50 
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t=5 t=10 




average degree d of agents average degree d of agents 



Figure 3: Trust between agents of the same profile as a function of level of cooperation and 
average degree of agents at t = 5 (left), and t = W (right). 

Figure [3] illustrates the trust between agents of the same profile as a function of the level of 
cooperation and the average degree of agents at t = 5, and t = 10. Initially, at t = 0, agents 
still have to learn who to trust (and the whole figure would be blue, corresponding to zero trust 
between everyone). At i = 5, trust is already developing; for larger average degrees of agents d as 
well as for larger levels of cooperation r], this happens faster. At t = 10, trust between agents of 
the same profile has developed for an average degree of agents d > 5 and a level of cooperation 
r] > 0.05. 

The obvious consequence of the evolution of trust is that predictions tend to match the profiles. 
We test this by measuring the performance of the system. Let the performance be defined as the 
sum of the products of the utility and the trust between all pairs of agents i and j: 

where n is the number of agents, e.g. in our case n = 500. Agents are exposed to ratings which 
lead to both positive or negative utility. By building trust, they give more weight to the positive 
utility and less weight to the negative utility. Therefore, this measures "how well agents use their 
trust". 

Figure [4] illustrates the performance over time: again, the average degree of agents is fixed, 
d = 7, and the level of cooperation r/ is variable, ranging from 0.01 to 0.25 in steps of 0.01. 
The performance converges to 1 for almost all d. The similarity to Figure [2] is due to the fact 
that agents who have developed trust to other agents of the same profile are provided with good 
recommendations from their neighbours; thus, these agents perceive high utility which leads to 
high performance. 
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Tl=0.1 




time t 

Figure 4: Performance over time, for a variable average degree of agents, but a fixed level of 
cooperation. 



t=1 t=5 
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Figure 5: Performance as a function of level of cooperation and average degree of agents at t = 
1 (left) and at t = 5 (right). 

Finally, Figure [5] illustrates the performance as a function of the level of cooperation and the 
average degree of agents at t = 1 and at t = 5. Again, just as the trust between agents of the 
same profile increases in Figure [3l the performance increases with increasing average degree of 
agents and level of cooperation. One might wonder how, at t = 1, the performance can already 
be nonzero - this is due to the fact that there are only two opposite profiles; this implies that 
half of the neighbours of an agent are of the same profile and, as soon as an agent has developed 
some trust to one of these neighbours, it will benefit from their recommendations which, again, 
drives the performance up. 
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4.5 Empirical Validation 

To support the analytical approximations of the model and the results of the computer simula- 
tions, we empirically tested the performance of a recommender system using our Trust WebRank 
(TW) metric against one using a standard Collaborative Filtering (CF) approach, similarly to 
what has been done in [17| . We crawled Epinions.com, an on-line platform which allows consumers 
to read and write reviews about products. The unique feature of Epinions is that users can also 
form a "web-of-trust" and specify other users that they trust with respect to their reviews. The 
crawling was performed in mid-2007 and led to a dataset of 60,918 users with 896,969 reviews 
on 223,687 products and with 518,505 relationships. We cleaned this dataset and removed users 
that either had not written any reviews or had no relationships to other users because no rea- 
sonable validation can be done with these users. Furthermore, we focus on the greatest strongly 
connected component (SCC) because a) there is only one large SCC and many small SCC (1-3 
users) and b) membership in this SCC can be seen as a proxy for having a properly formed web 
of trust. Having applied this procedure, we are left with 29,478 users, 731,220 reviews on 201,674 
products, and 471,888 relationships. The data sparsity is 99.9877 stars (max). There is a bias to 
review favourably, as 75 ratings are either 4 or 5 stars and only 25 or 3 stars - probably because 
users are more likely to spend time to write a review when they like a product. 

We split the reviews into a training set i?Training and a test set Rtbsi- We then compare the 
performance of TW and CF by training the algorithms on -RTraining and testing with -Rxest- TW, 
in general, has comparable performance to CF, and performs better in particular situations, as 
we will describe in the following. The complete empirical validation will, together with some 
statistical analyses of the Epinions community, be reported on in a separate paper |26| . 

Mean Absolute Error: the mean absolute error (MAE) is defined as 



Figure M shows the MAE of TW for changing P and CF. Depending on the value of (3, TW 
performs (marginally) better than CF. There is an optimal /3opt ~ 0.8. 

However, the fact that most ratings are 4 or 5 limits the meaning of the MAE as a measure of 
performance. Indeed, predictions based on the Simple Average (SA) of ratings on a product, a 
global algorithm which is not personalised for users, outperform both TW and CF: euAEiSA) = 
0.21. Similar results were found in [17] using a different dataset of Epinions (from 2003). An 
explanation for this is that reviews are very homogeneous and almost all ratings are positive. 
Other datasets, such as the commonly used MovieLens dataset, are more heterogeneous and SA 
performs worse than CF on such datasets. Unfortunately, at the moment, Epinions is the only 




(26) 
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Figure 6: Mean Absolute Error of TW (blue/circles) against /3 and CF (red/squares). The 
MAE is normalised to a scale in [0, 1], i.e. it reflects percentages. 

available dataset which combines rating data and a social network - and which is thus suitable 
to test the performance of TW. 

Coverage: coverage measures the percentage of elements that can be predicted from the training 
set. Both TW and CF cannot compute predictions for all elements in the test set. For example, 
if there is no similar or trusted user who has rated a particular product, CF or TW are not able 
to compute a prediction for that product. CF was able to compute 41.65 75.11 with CF. The 
reason for this is that TW is able to reach a large neighbourhood even when the neighbourhood 
based on co-ratings, as in CF, is small. 

Top-N Set Overlap: as noted, the value of ratings in Epinions does not seem to carry a lot of 
meaning - probably because people tend to rely more on the text of reviews than on the rating. 
Therefore, it makes sense to compare the performance based on the ability to predict the subset 
of products rated by a user. We define the following measures of overlap between sets: 



where Pi is the set of products rated by a user i; is the set of the N most rated products 
overall in the system; X denotes either CF or TW and thus Rcfi ^-TWi ^'^^ ^^^^ 
N most rated products in the neighbourhood of a user i constructed by CF and TW. Note that 
is a global set which is the same for all users i. Thus, of is the counterpart of eMAE(5'^) in 
this context. Rfpi ^TW^i ^'^^ personalised sets which depend on the neighbourhood of user 
i and thus are different for any two users. We define the average overlap across all users as O^, 
Ofp, and O^^. For = 100, we obtain » 0.0819, Of,p ^ 0.2526 and O^^ ^ 0.1724. Since 
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a larger overlap signifies a better prediction, the larger the values, the better the performance. 
This implies that the global measure performs worse than both O^p and Oj^VK- addition, 
CF performs better than TW. However, it should be emphasised that this measure is obviously 
biased in favour of CF: by definition. Pi fl RcFi ^- contrast. Pi fl R^wi '^^'^ empty, as a 
user does not necessarily declare trust to people who are have rated the same items. Still, TW 
performs significantly better than the global measure . 

This illustrates the difficulty to compare the performance of TW with CF. In fact, the most 
appropriate way to measure performance would be based on user-provided feedback subsequent 
to having followed a recommendation. 

In conclusion, we found that TW and CF have comparable performance. TW seems mostly useful 
for recommendations of items different from those a user has already rated - e.g. recommenda- 
tions on travel books for people usually interested in tools for gardening. 

5 Extensions and Conclusion 

We introduced a novel metric for computing indirect trust in social networks. We derived this 
metric from feedback centrality measures in graphs and illustrated how it addresses some limita- 
tions of other trust metrics; most importantly, that it takes cycles in the underlying graph into 
account. We constructed a simple model of a recommender system that makes use of our metric 
and showed how indirect trust can be used to generate recommendations. We performed analyt- 
ical approximations and computer simulations to characterise the system behaviour. Finally, we 
also tested the model by validating it with empirical data of an Internet community devoted to 
product reviews. 

Some extensions to this model could involve changing the trust dynamics: 

Trust update as a slow- positive, fast-negative dynamics. It has been observed in the literature 
that trust follows a slow-positive, fast-negative dynamics [U [TOl [151 EH ES]. This means that 
trust builds up slowly, but gets torn down quickly and this behaviour could be implemented by 
modifying Eq. (fTSl) . 

Coupling the utility with the level of cooperation rj. In real applications, if, initially, the utility 
for users is zero, then nobody will signal and this is a fixed point - and a social dilemma 
Thus, we could couple the probability of signalling to the utility and investigate how to make 
the system escape from this undesirable fixed point. 

With this work, we have shown that incorporating this novel trust metric in recommender systems 
is a promising and viable approach. 
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